Claims 

What is claimed is: 

1. A method o\ person tagging in an image processing system, the 
method comprising th*^ steps of: 

processing ^ sequence of images to generate a statistical 
model for each person to be tagged, the statistical model 
incorporating at least^ one appearance feature and at least one 
geometric feature of theytagged person; 

applying the modpl to at least one subsequent image in order 
to perform at least one of x a detection operation, a location operation 
and a tracking operation foV the tagged person; and 

controlling an action of the image processing system based 
on a result of the at least oke operation. 

2. The method of claim\ 1 wherein the sequence of images 
comprises a video segment. 

3. The method of claim 1 wherein the processing step further 
includes processing the sequence of Wages to generate a plurality of 
statistical models, each of the model^ corresponding to a particular 

^tagged person. 

4. The method of claim 1 wher&in the appearance feature 
comprises at least one of a color feature\and a texture feature. 

5. The method of claim 1 wherein the geometric feature comprises 
at least one of a region shape and a region position for a given one 
of a plurality of regions associated with the\statistical model. 

6. The method of claim 1 wherein the statistical model is 
generated at least in part by segmenting a given Vmage into a number 
N of different regions of similar appearance, 
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7. The method claim 1 wherein the statistical model generated 
for a given person comprises a likelihood probability function 
P (I I Q which indicated the likelihood that the person Q is present 



in a given image I 



\ 



8. The method of olaim 7 wherein the likelihood probability 
function P (I | Q for person^ Q is computed as 

P(l|Q) =\ £ P(* r |0)P(r|Q , 

A = 1,2,...N 

where R r is a function of the atueast one appearance feature and the 
at least one geometric feature, akd r is an index identifying one of 
N regions of similar appearance within the image I. 

9. The method of claim 1 whereiA the statistical model generated 
for a given person Q comprises a livelihood probability function 
P (I I T, Q , where T is a linear transformation used to capture global 
motion of the person in an image I, and A is a discrete variable used 
to capture local motion of the person at A given point in time 

10. The method of claim 9 wherein a location of the person is 
determined using the linear transformation T 



11. The method of claim 9 wherein a 
determined using the discrete variable £ . 



poW of the person is 



12. The method of claim 9 wherein the linear\ transformation T 
is used to obtain a sub-window of the image I that\is invariant to 
rotation and scale. 



13. The method of claim 9 wherein the linear transformation T is 
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implemented using A bilinear interpolation technique with a reference 
point x c in the image I, a rotation angle 0 f and a scaling factor s. 

14. The method o^ claim 9 wherein the local motion is modeled 
using a discrete set of states of the variable £ to capture 
M different poses of the person Q. 

V 

15. The method of ^laim 1 wherein the statistical model 
generated for a given person\Q and image I comprises a likelihood 
probability function \ 

\ 

p (i i t, 4, Q) = V p (pix i t, 4, n , 

pixk I 

where r is an index to regions of siAilar appearance and Wis a total 
number of such regions, r = 1, 2, . \. N, and 

P (pix | T, £, Q) = max [p (pix | At, 4, Q) P(r | 4, Q)l, 

r — If ... f N \ ^ 

where P (pix | r, T, 4, fi is the probability V>f observing pixel pix 
assuming that it belongs to an r-th region of\ the model on a pose 
and P(r I 4, Q is the prior probability of the \egion at that pose. 

16. The method of claim 15 wherein the Vegions of similar 
appearance include a dummy region having a constant probability as 
follows : 

p (P^ I r occlusion , T, 4, Q) P (r occlusion |£, Q) = P occl \ . 



17. The method of claim 15 wherein each of at least a\ subset of 
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the pixels of the yLmage I is characterized by a two-dimensional 
position vector x anci by an appearance feature vector f such that: 



\ 
\ 

_\ 



P (pix | r, T,\£, fi) = P (x | r, T, £, Q) P (f | r, T, £, Q. , 



\ 



where P(x|r, T,^Q and \p (f | r, T, £, n are approximated as Gaussian 
distributions over corresponding feature spaces. 

\ 

18. The method of claim 1 wherein the controlling step comprises 
generating an output of the\ image processing system based on the 
result of the at least one operation. 

19. The method of claim 1 wherein the controlling step comprises 
altering an operating parameter df the image processing system based 
on the result of the at least one \operation . 

20. An apparatus for use in providing person tagging in an image 
processing system, the apparatus comprising: 

a processor operative to prbcess a sequence of images to 
generate a statistical model for eadfo person to be tagged, the 
statistical model incorporating at least bne appearance feature and at 
least one geometric feature of the tagged\person, the processor being 
further operative to apply the model to atVLeast one subsequent image 
in order to perform at least one of a detection operation, a location 
operation and a tracking operation for the tkgged person, and further 
wherein an action of the image processing system is controlled based 
on a result of the at least one operation. 

21. An article of manufacture comprising V storage medium for 
storing one or more programs for use in providing feerson tagging in an 
image processing system, wherein the one or m\re programs when 
executed by a processor implement the steps of: 

processing a sequence of images to generate a statistical 
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model for each person\ to be tagged, the statistical model 
incorporating at least oije appearance feature and at least one 
geometric feature of the ta v gged person; and 

applying the model \to at least one subsequent image in order 
to perform at least one of a detection operation, a location operation 
and a tracking operation for tzhe tagged person; 

wherein an action 6f the image processing system is 
controlled based on a result of the at least one operation. 
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